PCR-free library preparation greatly reduces stutter noise at short tandem repeats

نویسنده

  • Melissa Gymrek
چکیده

Over the past several decades, the forensic and population genetic communities have increasingly leveraged short tandem repeats (STRs) for a variety of applications. The advent of next-generation sequencing technologies and STR-specific bioninformatic tools has enabled the profiling of hundreds of thousands of STRs across the genome. Nonetheless, these genotypes remain error-prone, hindering their utility in downstream analyses. One of the primary drivers of STR genotyping errors are “stutter” artifacts arising during the PCR amplification step of library preparation that add or delete copies of the repeat unit in observed sequencing reads. Recently, Illumina developed the TruSeq PCR-free library preparation protocol which eliminates the PCR step and theoretically should reduce stutter error. Here, I compare two high coverage whole genome sequencing datasets prepared with and without the PCR-free protocol. I find that this protocol reduces the percent of reads due to stutter by more than four-fold and results in higher confidence STR genotypes. Notably, stutter at homopolymers was decreased by more than 6fold, making these previously inaccessible loci amenable to STR calling. This technological improvement shows good promise for significantly increasing the feasibility of obtaining high quality STR genotypes from next-generation sequencing technologies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Characterisation and Filtering of Systemic Noise in Next Generation Sequencing of Short Tandem Repeats with Applications in Forensics

In forensic analysis of DNA samples, short tandem repeat (STR) regions in the genome are selectively amplified with PCR to subsequently determine which alleles are present. The DNA polymerase used to amplify STRs with PCR is known to ‘slip’ occasionally, which results in an additional PCR product having one repeat unit more or less than the original allele. These so-called ‘stutter products’ ma...

متن کامل

Improving sequencing quality from PCR products containing long mononucleotide repeats.

Stutter products are a common artifact in the PCR amplification of frequently used genetic markers that contain mononucleotide simple sequence repeats. Despite the importance of accurate determination of nucleotide sequence and allele size, there has been little progress toward decreasing the formation of stutter products during PCR. In this study, we tested the effects of lowered extension tem...

متن کامل

Development of Highly Polymorphic Pentanucleotide Tandem Repeat Loci with Low Stutter

INTRODUCTION All eukaryotic genomes contain regions of simple repetitive DNA, called short tandem repeats (STR*) or microsatellites, which consist of tandem repeats of a small number of bases (1-3). The number of repeats at a STR locus can be highly variable among individuals, resulting in length polymorphisms that can be detected by relatively simple PCR-based assays. Thousands of these highly...

متن کامل

Microsatellite (SSR) amplification by PCR usually led to polymorphic bands: Evidence which shows replication slippage occurs in extend or nascent DNA strands

Microsatellites or simple sequence repeats (SSRs) are very effective molecular markers in population genetics, genome mapping, taxonomic study and other large-scale studies. Variation in number of tandem repeats within microsatellite refers to simple sequence length polymorphism (SSLP); but there are a few studies that are showed SSRs replication slippage may be occurred during in vitro amplifi...

متن کامل

Microsatellite (SSR) amplification by PCR usually led to polymorphic bands: Evidence which shows replication slippage occurs in extend or nascent DNA strands

Microsatellites or simple sequence repeats (SSRs) are very effective molecular markers in population genetics, genome mapping, taxonomic study and other large-scale studies. Variation in number of tandem repeats within microsatellite refers to simple sequence length polymorphism (SSLP); but there are a few studies that are showed SSRs replication slippage may be occurred during in vitro amplifi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016